# Unified visual representation
Florence 2 Large Ft Safetensors
MIT
Florence-2 is an advanced visual foundation model developed by Microsoft, employing a prompt-based architecture to unify various vision and vision-language tasks
Image-to-Text
F
mrhendrey
162
2
Chat UniVi
Chat-UniVi is a large language model with unified visual representation that can understand both image and video content simultaneously.
Image-to-Text
Transformers

C
Chat-UniVi
12.10k
17
Featured Recommended AI Models